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ABSTRACT 

The detection of galaxy clusters in present and future surveys enables measuring 
mass-to-light ratios, clustering properties, galaxy cluster abundances and therefore, 
constraining cosmological parameters. We present a new technique for detecting galaxy 
clusters, which is based on the Matched Filter Algorithm from a Bayesian point of 
view. The method is able to determine the position, redshift and richness of the clus- 
ter through the maximization of a filter depending on galaxy luminosity, density and 
photometric redshift combined with a galaxy cluster prior that accounts for color- 
magnitude relations and BCG-redshift relation. We tested the algorithm through re- 
alistic mock galaxy catalogs, revealing that the detections are 100% complete and 
80% pure for clusters up to z <1.2 and richer than Acl >20 (Abell Richness ~0, 
M~4xlO 14 M0). The completeness and purity remains approximately the same if we 
do not include the prior information, implying that this method is able to detect galaxy 
cluster with and without a well defined red sequence. We applied the algorithm to the 
CFHTLS Archive Research Survey (CARS) data, recovering similar detections as pre- 
viously published using the same or deeper data plus additional clusters which appear 
to be real. 

Key words: large-scale structure of Universe - galaxies: clusters: general - galaxies: 
abundances - galaxies: luminosity function, mass function - cosmology: theory. 



1 INTRODUCTION 

Galaxy clusters are the largest virialized structures in the 
Universe. They are valuable laboratories for investigatin; 
environmental effe cts on galaxy evolution dTreu et al.ll200 
iMoran et al.l 120071 ') as well as the properties of their dark 
matter content. In addition, the evolution of their global 
properties through time and distance are of great impor- 
tance to constrain the cosmological parameters of the Uni- 
verse. Therefore, it is important to define samples of clusters 
which span a wide range of redshift and cluster mass with a 
minimum of false positives while remaining as complete as 
possible. 

Achieving this goal requires very good wide-field survey 
data as well as very good algorithms. In the past decade, 
survey data have vastly improved both in terms of quan- 
tity (SDSS, DLS, CFHTLS...) and quality (depth, resolu- 
tion, wavelength coverage, and uniformity). Additional sur- 
veys recently begun or under construction including Pan- 
STARRS, KIDS, DES, HSC and LSST will deliver even more 
and better data. In this context, it is important to develop 
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cluster-selection algorithms which take maximum advantage 
of the large amount of information available. 

A number of methods to detect cluster s have been 
devel oped based o n cluster X-ray emission dRosati et all 
2002h . weak lensing l| Tyson et al.lll990l : IWittman et al.ll200ll, 
20051) a nd the Sunyaev-Z e ldovic h effect (SZ:ICarlstrom et all 
i|2002h ; I Ascaso fe Moles! (120071 ); iMenanteau et all i|2009l) ). 
Furthermore, the cluster detection methods based on op- 
tical data have provided a large dataset of clusters. They 
take advantage of the larger number of bands, better photo- 
metric redshift quality or improvement in depth of the new 
surveys. We summarize below the main optical cluster de- 
tection methods in three main groups based on the use of 
different optical characteristics. 

The first group of methods is based on the geomet- 
ric distribution of the galaxies. They usually create den- 
sity maps based on the spatial distribution of the sources. 
The candidate clusters are identified as overdensities over 
the mean backgrou nd. These method s include the Counts 
in C ells method (|Couch et al.l Il99ll : 
19961). the Percola tion Algorithms 



Lidman fc Peterson! 
i Dalton et all Il997l : 



Botzler et all 12004) . the Voronoi Tessellation algorithm 



I Ramella et all 120011 : iKim et all 120021 : lLopes et all 12004 ) 
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or the Friends of Friends Algorithm dRamella et all |2002| ; 
iBotzler et ail 12004 Ivan Breukelen fc Clewlevl l2009h . They 
have the advantage that they do not bin data or make 
any assumption prior to the detection process. Thus, non- 
regular structures can be detected. However, these meth- 
ods usually have large false de tection rates (| Couch et al.l 
Il99ll : iLidman fc Peterson! Il996l) , or do not go as deep as 



Matched Filter dMilkeraitis et al.ll2010l ). They h ave been ap- 
plied to NoSOCS jGal et al.ll200Cl). CFHTLS JOlsen et all 



other methods (Kim et al 



■99a) . or do not go as deep as 
20021 ; lLopes et all 120041 ). Some 



variants have been introduced l ately taking into account 
photo metric redshifts or c o lors jKim et al.l 20021 ). magni- 
tudes dRamella et alj l200ll ; lLopes et alj |2004| ) or redshifts 
l|Wav et al.l |2010|) in the Voronoi Tessellation Method to 
minimize the background and foreground contamination. 

The second group takes into account the presence 
of the red sequence of the clusters and the fact that 
the early type population, including the brightest clus- 
ter galaxy (BCG), of the clusters h ave predictable co l- 
ors and in some cases, magnitudes (|Ascaso et al.l l201ll ). 
Some algorithms based on these proper ties are the Cut- 
and-Enhance Algorithm ( Goto et al. 20021), the Cluster Red 
Sequence Method (Gladdcrs & Ycc 20 05 lLopez-Cruz et all 
20041; iGladders fc Yeell2005l) . MaxBCG l|Hansen et al.ll2005l ; 



Koester et al 



[ Miller et al 



2007) and the C4 cluster-finding algorithm 
2005). Some of these algorithms have been 



successfully applied to surveys such as SDSS or the Red 
Sequence Cluster Survey (RCS). Generally, those methods 
generate a set of color slices based on a particular model and 
select subsets of galaxies to belong to each slice. Then, peaks 
in the density maps are selected. Those algorithms work well 
in sufficiently deep photometric catalogs and provides robust 
photometric z estimates. However, the presence of the red 
sequence in all kind of clusters is still a matter of debate. Al- 
th ough many stud i es have proved it to exist up to redshift ~ 
1 dMei et al.ll200d ; IXscaso et alj|2008l; lAndreon et alj|2008l ; 
iMei et al.ll2009l ). others (jPonahue et ; al J l2002h argued that 
the red sequence could be a characteristic only for virialized 
massive clusters while less massive clusters may not show 
it even at moderate redshifts. Thus, this method could be 
biased towards the high end of the mass function. 

A higher redshift version of this algorithm has been im- 
plemented with SpARC S (Spitzer Adaptat ion of the Red- 
Sequence Cluster Survey; [Wilson et al.l l2008j), which samples 
the 4000A break by analyzing Spitzer 3.6 micron and z band 
data. So far, some clusters have been confirmed spectroscopi- 
cally l| Wilson et al.ll2009l ; lMuzzin et aHl2009l ; lDemarco et all 
2010). However, the main concern about this method is the 
possible bias towards clusters showing the red sequence fea- 
ture. 

Finally, the third group includes those methods where 
different cluster characteristics such as the luminosity or 
density are taken into account to model the clusters. In 
general, the density and luminosity distributions of galax- 
ies are first modeled for different redshift slices and conse- 
quently probability density maps are generated. Then, clus- 
ters are selected as the peaks of these maps a nd their redshift 
and richnes s estimated. The Matched Filter (|Postman et al.1 
Il996ll2002h has been applied to detect clusters in different 
surveys such as the Palomar Distant Cluster Survey. Some 
later modifications include the Adaptive Mat ched Filter 
dKepner et al.lll999l ). the Hybrid Matc hed Filter (iKim et all 
2002), the Simple Smooth ing Kernels (IShectmanll 19851 ). the 
Adaptive Kernel method (|Gal et al.ll2000l , 120031 ) or the 3D- 



l2007l; iGrove et alj 120091; iMilkeraitis et al.ll2010l ) and SDSS 
|Dong et al.ll2008l ; ISzabo et al.l 120101 ). These methods usu- 
ally recover a high completeness rate and small contam- 
ination. However, the results might be model dependent. 
A slightly different algorithm that can be included in 
this group is the Surface Brightness (SB) Enhancements 
j|Zaritskv et al.lll997l, I2002T ). which consists of detecting the 
localized cumulative SB enhancement due to unresolved 
light from galaxies in distant clusters. 

Another important variation of this technique at hig h 
redshift has been introduced by lEisenhardt et all ([2008) . 
They used 3D overdensities based on the photo-z distribu- 
tion. In their simulations, they found that only 10% of the 
detections are spurious. They identified 106 galaxy clusters 
at z>l in 7.25 square degrees in the Spitzer Infrared Array 
Camera (IRAC) Shallow Survey. To date, they have con- 
firmed 12 of them spectroscopically within a redshift range 
1 < z < 1.41. 

In addition, new ways of detecting methods at higher 
redshift have been developed for de tecting galaxy clusters 
around radio galaxie s at z~ 1.5-2 (|Galametz et al.1 120091 ; 
IChiaberge et alj|20ich . 

The motivation of this work is to take advantage of 
all the characteristics of the present methods by modeling 
each cluster property. There might be some cluster prop- 
erties that do not show up in certain ranges of redshift or 
mass, for example the CMR, but we still want to detect a 
cluster if one of these properties is not present. To accom- 
plish this, we have designed a Bayesian cluster finder where 
each galaxy is assigned a Bayesian probability that there is 
a cluster centered in that galaxy at a given redshift. This 
method takes into account cluster properties such as the 
cluster luminosity function, density profiles and photomet- 
ric redshift distribution to generate a likelihood probability 
as a matched filter technique variation. In addition, we have 
included cluster properties such as the presence of a red se- 
quence and the BCG magnitude - redshift relation as a prior 
term. Although these are properties that have been well ob- 
served, it is not clear that they exist and are present in every 
kind of cluster. With this formalism, we can easily turn on 
or off the prior term and as a result, be able to detect any 
kind of cluster. The uniqueness of this method comes from 
the inclusion of different sets of cluster properties. 

We have tested the algorithm extensively on simula- 
tions. We have derived the completeness and purity rates 
of the detections and we have studied the improvement or 
dependence of our detections on the introduction of such a 
prior. Moreover, we have also tested the algorithm on real 
data. We have applied the algorithm to CFHT LS- Archive- 
Research Survey, (CARS, lErben et all (I2009D) an d com - 
pared our results wi th those obtained bv lOlsen et al.1 l|2007l ). 
[Ad ami et al.1 l|2010l ) and X -ray spectroscopically confirmed 
detections in the same fields. We find the results to be in 
fairly good agreement, providing support to our results. 

The structure of the paper is as follows. In section 2, 
we describe the method we have implemented. Section 3 is 
devoted to simulations to study the completeness, purity 
and false positive rate of the re sults. Section 4 sh ows the 
cluster detections in CARS data (|Erben et al.ll2009h and the 
comparisons with other works. In Section 5, we include a 



Bayesian Cluster Finder -.Clusters in the CFHTLS Archive Research Survey 3 



summary and discussion of the results. Where appropriate, 
we have used the Acdm cosmology Hq=71 km s _1 Mpc~ , 
Oi\f =0.23, S1l=0.73 throughout this paper. 



2 THE METHOD 

We consider that a cluster in an (X, Y) position, can be 
defined by N g , R c and z c , that is the number of galaxies in a 
given aperture ('richness'), the cluster radius and a redshift. 

The set of galaxies in the field or in a cluster can be 
described by their coordinates (xi,yi), photometric redshift 
Zi (including color information) , magnitudes mi and spectral 
types U estimated from a photometric redshift algorithm. 

The probability for the existence of a cluster centered 
on the position (X, Y), given a dataset D and a priori infor- 
mation 7 is 



p(X,Y,N g ,Rc,z c \D,I) 



(1) 



Finding this probability is proportional (applying 
Bayes' Theorem) to 



p{X,Y,N 3 ,R c ,z c \D,I) 

oc p(X, Y, N g ,R c , z c \I)p(D\X, Y, N g ,R c , z c 

where the first term 

p{X,Y,N g ,R c ,z c \I) 



(2) 



(3) 



is the prior, or the information which is not implicit in the 
data, such as the distribution of the cluster properties, clus- 
ter redshift and spatial cluster distribution, and the second 
term is the likelihood 



p(D\X,Y,N g ,R c ,z c 



(4) 



which is the probability of observing galaxy data D if there 
is a cluster in (X, Y) with z c , N g and R c (also known as the 
likelihood) . 

The likelihood can be rewritten as 



L(X,Y,N g ,R c ,z c )=p(D\X,Y,N g ,R c ,z c ) = 

N 

TTp^ijJ/i, Zi, mi,ti\X, Y, N g , R c , z c ) 

i=l 

or similarly, as 
\nL{X,Y,N g ,R c ,z c ) = 

Si=i ^°-P( x i,Vi, z z ,m z ,ti\X, Y, N g ,R c , z c ) 



(5) 



(6) 



That is, the product of the likelihood for each galaxy, 
where the likelihood is the probability of observing a galaxy 
(xi,yi), with a given redshift, magnitude and spectral type, 
if there is a cluster at the (X, Y) position with N g , R c and 

Z c . 



2.1 Likelihood 

The likelihood or predicted density probability (n) for a clus- 
ter can be described by 



\nL(X,Y,N g ,R c ,z c ) = 

^ . n(xi,yi, Zi,rrii,ti\X, Y, N g ,R c , z c ) = (7) 
^2 . P{xi, yi\X, Y, R c , N g ,z c ))L(mi, Zi\z c )p(zi,ti\z c ) 

where P(r) is the cluster radial profile. The sum is made 
over all the galaxies within a radius of r cut from the central 
one and within the limits of reliability of the photo-z. 

We use the Plummer profile to model P(r) 
l|Postman et al.lll996l ). 



P(r) = <^ V 1 +( r / r o) 2 v /l +('-cW>-c) : 





where r is calculated as 

r(xi,yi\X,Y,z c ) = D A (z c )e iC {x t ,y 1 \X,Y) 



if r < Tent 

if r > r cu t 



(8) 



(9) 



where Da(z c ) is the angular diameter distance at the clus- 
ter's redshift z c . 

We choose the core and cut radius as r c = 0.150 Mpc 
and r C ut = 10r c respectively. 

As far as the luminosity filter is concerned, we have 
adopted a total luminosity profile described by 



L(m,i, Zi\z c ) = b(rrii) + 4>(mi, Zi\z c ) 



(10) 



where b(rrii) is the number of background galaxies and 4>(L) 
is th e standard Schechter luminosity function (Schechter 

Eazi) 

<i>{U,z % \zc) = 0.4(\nlO)n{L l /L*) 1+a exp(-L t /L*) (11) 

being Li = Li(rrii, Zi) the luminosity for each galaxy within 
rc and L* — L*(z c ) the characteristic luminosity correspon- 
dent to the luminosity function of a cluster at a given red- 
shift z c . We have applied k-correc tions to the magnitudes b y 
using the K-correct package from lBlanton fc Roweisl (|2007h . 
We selected the valu es of M*(0)=-20.44 and a = -1.05 from 
iBlanton et ail (|2003l ) in r band for local clusters, where M* 
is the magnitude associated to L* . We also applied the fol- 
lowing evolutionary correction to M* , 

M*{z) = M*(z = 0)-z (12) 

extracted from iPostman et~al1 il2002D which is consistent 
with observations I Postman et al. I l200ll ; iNakat a et aljfeooil ; 
iBrown et alll2007l ). 

We estimate the background galaxy counts b(m) in any 
given dataset in two iterative steps. We first detect galaxy 
clusters by using the backgr ound counts extracted from 
GOODS in R band fromH arsono fc DeProprisl {2009) as an 
initial estimate. Then, we obtain the survey's background 
galaxy distribution by masking out these detections from 
the survey. Finally, we run the detection algorithm again 
with the right background distribution. 

The uncertainties on galaxy redshifts can be included 
in the probability with the following factor 



P(Zi\Zc) = 



PPDF(Zi\z c )dz 



(13) 



where ppdf is the redshift probability distribution function 
for each galaxy obtaine d by any photometric redshift code 
like BPZ (|Benftezll2000l ). The limits of the integral are set to 
account for the spread of the redshifts of the galaxy cluster 
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and their photometric errors and the fact that the estimated 
redshift could be shifted from the actual redshift value by 
Az/2, where Az refers to the difference between two consec- 
utive redshift slices. We have chosen a c — (1 + z c )0.06. The 
actual value of a c is set from the data quality of the survey 
that we are simulating. 

In case the ppdf for each galaxy is not available, we 
can model p(Zi\z c ) by ass uming a Gaussian distribution for 
the redshift distribution (|Dong et al.l 120080 with the same 
dispersion as before, er c . 



p(zi\z c 



exp[ 



-{zi-z c y/2a}]/V2r, 



(14) 



2.2 Prior 

The prior probabilities for the cluster distributions refer to 
already known properties of clusters that can help enhancing 
the likelihood. We decompose them into two different parts. 

The first term models the probability of a spatial group- 
ing of galaxies having the expected red sequence of a clus- 
ter at its redshift. This probability can be quantified as the 
departure of the data from the expected red sequence of 
clusters at this redshift. 



p(coh) = exp[-(cok - col(z c ))/2al oh 



(15) 



where coU refers to the slope of the color-magnitude relation 
for the galaxies within a 1.5 Mpc and within a photometric 
redshift range of z c ± Az/2 ± 0.01(1 + z) and col(z c ) is the 
expected slope at a particular redshift slice z c . 

We estimate synthetic color-magnitude relations com- 
puting the expected colors from a set of templat e spec - 
tra of E/S0, Sbc, Scd, and Irr by I Coleman et HI dl98Gh 
and o f starbursting galaxies SB3, SB2 by iKinnev et al.l 
(1996) and adding a fixed slope obtained from an individ- 
ual well-characterized galaxy clusters in the survey being 
modeled. We selected c olors that sample the 400 0A break 
jGladders fc Yed |200C| . 120051 ; IWilson et all |2006| ) for each 
redshift slice. For instance, we chose g — r color for z s ^ 
0.3 and r — z color for z 3 > 0.3 for the CARS simulation. 
The expected g — r and r — z CMR for each redshift slice 
are shown in Figure [T] 

The second term takes into account the fact that each 
cluster has a brightest cluster galaxy (BCG) and that there 
is a tight relation between the apparent magnitude of the 
BCG and t he redshift of this BCG and therefore the clus- 
ter redshift jGladders fc Yedl200d ; iKoester et alj|2007j ). We 
obtain an empirical BCG magnitude- redsh ift relation by 
fittin g the 13823 BCGs sample from SDSS |Koester et all 
12007ft and extrapolating it to higher redshift. 

We model this term in the prior with an additional 
Gaussian term, 



p{m B CG{z)) = 



exp[— (trij — m B CG(z)) 2 /2al 



2lY(T ri 



(16) 



where rrn refers to the magnitude of the brightest galaxy 
within 1.5 Mpc aperture from each galaxy, 771bcg{z) is the 
expected magnitude of the BCG at the expected cluster red- 




shift and <j„ 



refers to the magnitude-redshift dispersion 



obtained from the empirical fit. 



Figure 1. g-r(upper plot) and r-z (bottom plot) synthetic galaxy 
clusters color-magnitude for redshift slices ranging from z=0.1 to 
z=0.4 (g-r) and from z=0.5 to z=1.2 (r-z). 



2.3 Cluster determination 

Computing the probabilities for each galaxy in the survey 
is only the first step. In this section, we describe how we 
select, merge and purify the list of cluster candidates. 

First, we determine the probability threshold for cluster 
candidates as follows. For each redshift slice, we first com- 
pute the background probability level and its dispersion a 
by fitting a Gaussian to the whole probability distribution 
(for the lowest redshift slice, z=0.1, we use a log-normal 
to better fit the asymmetric probability distribution). The 
background level and a are set to the first and second mo- 
ment of this Gaussian. Then, we select only those galaxies 
that have probability larger than 3cr above background. In 
Figure O we illustrate this with the probability distribution 
of a redshift slice of z=0.3. 

After that, for each galaxy we assign the redshift slice 
where its probability achieves a maximum over all the red- 
shift slices. Then, for each redshift slice, we will obtain only 
the galaxies with maximum probability over the threshold. 

Next, we make density maps using the selected galaxies 
for each redshift slice and we define galaxy clusters from 
the density peaks. For each redshift slice, z c , we select the 
maximum peak of probability and set the initial center of 
the candidate cluster on it. We assign z c as the redshift for 
the detection. We find the boundary of the cluster by finding 
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Figure 2. Background probability for a redshift slice of z=0.3. 
The jagged curve is the probability histogram, and the smooth 
curve is a Gaussian fit to it. The solid vertical line refers to the 
background probability while the dotted line indicates the 3<r 
threshold above which we select the galaxies belonging to clusters. 



where the radial probability profile is nearly fiat, specifically 
1 < 0.1. All the galaxies included within this boundary are 
used for the following calculation. We calculate an additional 
redshift estimate from the photometric redshift distribution 
of the galaxies within 1.5 Mpc from this center, z est . We 
fit a Gaussian to the redshift distribution of galaxies in the 
clusters and we adopt the peak value as z es t- 

The cluster richness is estimated with the Acl parame- 
ter. It is the effective number o f L* galaxies in t he clu ster and 
within 1.5Mpc. Simulations in lPostman et all (|2002l ) showed 
that this parameter is empirically related to Na, the Abell 
richness, that is the number of galaxies brighter than mz + 2 
in the central 1.5Mpc, rri3 being the third brightest member 
in the cluster. This richness is redshift-dependent, since it 
depends on the depth of the data. Therefore we have applied 
a correction based on the fraction of luminosity that we miss 
due to the completeness of the survey. This correction can 
be written in the following way: 

f mfaint <f>{m)dm 

acl(z) = a C lAz) z:: 9h \, (17) 

/ (blm)am 

•'"'■bright 

where m co is the detection limit of the survey, and Acl,o and 
Acl are the corrected and uncorre cted richness respectively 
(see ISchuecker fc Boehringerl 1 1998h . 

We run this process iteratively to the next maximum 
peak that has not been previously assigned to a cluster until 
there are no more galaxies above the threshold not assigned 
a cluster. With this preliminary list of clusters, we recenter 
the detection on the BCG candidate by iteratively looking 
for the brightest cluster member within 1.5 Mpc and within 
a redshift distance from the resdhift slice of ^(1 + z). From 
this new center, we recalculate z es t as before, keeping the 
boundary fixed. 

Finally, we filter the preliminary list of cluster detec- 
tions. First, we check that the redshift which maximizes the 
probability is consistent with the photometric redshift of the 
member galaxies. Thus, we selected as real clusters those 
that meet the following requirement, 



tions due to the photometric errors that can make a cluster 
be detected in multiple redshift slices. Thus, when two or 
more clusters within two consecutive redshift slices are at 
transverse distance less than 1.5 Mpc, we just merge them 
into the one with highest probability. 



3 SIMULATIONS 

We performed simulations to test the accuracy of the al- 
gorithm and quantify the completeness and purity of the 
results. 

We created catalogues of galaxies, mimicking the ob- 
served properties of the simulated survey. For each richness, 
we simulated a 1 degree field with a cluster at z—0.1, a 1 
degree field with two clusters at z=0.2 and z=0.3 and a 1 
degree field with nine clusters at z—OA to z=1.2. We simu- 
lated nine different richnesses: Acl ~W, 20, 30, 40, 50, 75, 
100, 150 and 200. 

To mimic photometric redshift errors, we spread each 
cluster's galaxy population in a Gaussian with a = 0.01(1 + 
2 C ) and fi = z c . The total number of galaxies is obtained 
assuming that the cluster luminosity function has a slope 
q and characteristic magnitude M* of -1.05 and -20.44 re- 
spectively. We set the detection magnitude limit as the one 
measured from the simulated survey. The density distribu- 
tion follows a Plummer profile as in equation (J5J . We spread 
the magnitudes according to diff erent luminosity func tions 
for different spectral types as in iNakamura et al.l (|2003h ex- 
trapolated to higher redshift. We added to each magnitude 
a photometric error measured from the simulated survey. 

The distribution of the two colors that sample the 
4000A break at low and high redshift were gen erated by us- 
ing th e empirical cluster color distribution bv lBaldrv et al.l 
(2004), including noise. We shifted their colors to our es- 
timated colors obtained fro m the template spectr a set of 
E/S0, Sbc, Scd, and Irr by IColeman et all (Il980l) an d of 
starbursting galaxies SB3, SB2 bv lKinnev et al. I (| 19961 ) and 
an empirical slope from one of the well-known galaxy clus- 
ters in the survey, as explained in section 2.2. In Figure |3j 
we show the simulated g — r and r — z CMR for a Acl = 100 
cluster at redshift 0.2 and 0.5 respectively. 

Then, we embedded these simulated clusters into a dis- 
tribution of field galaxies. In order to add the most realis- 
tic background field galaxies in the simulation, we used the 
properties of the background field galaxies in the real sur- 
vey data. To obtain the survey background distribution, we 
detected clusters in the survey and masked them. Then, we 
redistributed the positions of the remaining galaxies by as- 
suming an angular two-point correlation function similar to 
the galaxies in the real universe. We used the Rayleigh-Levy 
galaxy pair separation distribution: 



P(> O) = {O/9 o )- d tfO>0 o 



(19) 



\z c — Zest | ^ Az(l + Z c 



(18) 



By doing this, we exclude ~30% of the initial detections. 
Additionally, a single cluster can result in multiple detec- 



We used dp = 3.1 9 " and d = 0.8 extracted from the 
work by iGrazian et alj (120061) for the G O ODs S urvey . Fol- 
lowing the approach o f iPostman et al.l (1 19961 . 120021 ) and 
iMenanteau et alj ((2009J), we allow up to seven galaxies to 
be so distributed about a given center, following a random 
walk with this angular distribution. Once the seven galax- 
ies are generated, we choose at random a different center 
until the number of galaxies matches the observed number, 
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Figure 3. Upper plot: r — z CMR for a z=0.5 simulated cluster 
with A CL =100. Bottom plot: g — r CMR for a z=0.2 simulated 
cluster with Ac^=100. 



normalized to match the simulated survey counts. The orig- 
inal magnitudes, morphological types, colors and photomet- 
ric redshifts of the field galaxies are preserved. Only the 
positions are changed. 



3.1 Completeness and purity rates 

We examined which simulated clusters were detected and 
which detected clusters were not simulated. We define a re- 
covered cluster as one which was simulated and then de- 
tected. 

We define completeness as the rate of recoveries, that is, 
the ratio of recovered clusters to the total number of clusters 
simulated, and we define the purity as the ratio of recovered 
clusters to the total number detected. 

In Figure 2] we show the completeness and purity re- 
spectively as a function of redshift and Acl- The complete- 
ness is >90% for all richnesses, at all redshifts for z ^ 1.2, 
the highest redshift simulated. The purity is over ~ 80% for 
clusters with Acl zZ 20 for z ^ 1.2. In other words, galaxy 
clusters richer than Acl ?z 20 are almost 100% recovered, 
with at most 20% of extra false positive detections for z 
<1.2. 

Additionally, we performed different sets of simula- 
tions to prove the robustness of these simulations. We have 
tested possible observational effects such as contamination 
or misidentification of close clusters in projected and red- 
shift space, deviations from the simulations density profile 
or different choices of the cluster and core radius. Simula- 
tions show that most of these effects are negligible, proving 
the robustness of the method. The only non-negligible effect 
is if the real profile of the cluster is too different from the one 
employed to carry out the fit, which causes a maximum 10% 
decrease in purity for poor (Acl ^ 3 0) clusters when the 
true profile is a Navarro- Frenk- White (|Navarro et al.lll997f ) 
or a random tophat profile. 




Figure 4. Completeness (bottom plot) and purity rate (top plot) 
for the simulated clusters with Acl greater than 10, 20, 30 and 
40 (solid, dotted, dashed and dotted-dashed line respectively). 



In view of these results, to keep the completeness and 
purity rates over 80%, we will consider real clusters detec- 
tions those with Acl ^ 20 for z ^ 1.2. 



3.2 Effect of the prior 

The universality of the red sequence in clusters has not been 
prov ed. According to the main scenarios of galaxy forma- 
tion l|White fc Frenk|[l99ll ) , we would expect to find clusters 
without red sequence in the low mass regime or high red- 
shift range. In order to take those clusters into account, we 
performed some sanity checks. 

First of all, we checked how well we are recovering the 
galaxy clusters without the introduction of the prior. In Fig- 
ure [5] we show the same completeness and purity fraction 
as in Figure [4] but just for the likelihood. The results are 
very similar. The completeness of the detections is approx- 
imately the same as when using the whole probability. We 
find a completeness higher than 90% for all richnesses and 
redshifts. The purity decreases by less than ~10% for z<0.6 
and Acl ^20 . Therefore, we do see that for a sample of 
galaxy clusters showing with a predefined red sequence for 
all of them, the results of using the prior or not would be 
very similar. 

Second, we consider a more realistic mix of clusters, 
i.e: galaxy clusters with and without red sequence in the 
same simulation. We performed another set of simulations 
considering half of the clusters with and half of the clusters 
without red sequence. Since the background probability is 
calculated from all the galaxies in the survey including those 
in clusters, the background could decrease slightly due to the 
lower values of probability in the galaxy cluster without red 
sequence and, therefore, we might have obtain more spurious 
detections. In Figure[6] we show the completeness and purity 
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Figure 5. Completeness (bottom plot) and purity rate (top plot) 
for the simulated clusters with A-cl by using only the likelihood. 
The meaning of the lines are the same as in Figure [4] 



Figure 6. Completeness (bottom plot) and purity rate (top plot) 
for the simulated mix of clusters. The meaning of the lines are 
the same as in Figure [4] 



rates of the whole distribution and in Figure [7] the same but 
just for the clusters without the red sequence. The results 
are very similar for the whole distribution of clusters with 
only the completeness decreasing <10% as redshift increases. 
However, when we consider the galaxy clusters without red 
sequence the purity decreases to 60% for galaxy clusters with 
A ^ 20. In this case, we define purity as the ratio of recovered 
clusters without a red sequence to the total number detected 
excluding the total number detected with a red sequence. 

We show in Figure |SJ the results of using just the like- 
lihood for those clusters without a red sequence generated 
in the mixed simulation. The rates keep approximately the 
same since those galaxy clusters have been simulated with- 
out a red sequence by construction. In summary, the inclu- 
sion of the prior in the main probability and the considera- 
tion of the photometric redshifts allow the method to obtain 
high purity and completeness rates, being able to detect al- 
most every cluster richer than Acl ^ 20 for z ^ 1.2 with 
or without red sequence with a very low contamination rate 
(20% at most). 



3.3 Cluster property recovery 

We compare the simulated cluster properties with the recov- 
ered ones. In Figures [9l 1101 and llll we show the offset in the 
recovery of the position, redshift and richness respectively, 
as a function of redshift and richness. 

The positions are quite well recovered, with offsets from 
100 kpc for the most massive clusters up to 500 kpc for the 
least massive ones. Redshifts are very well determined (be- 
tween 0.001 difference for the most massive clusters and up 
to 0.008 for the least massive clusters). Finally, the rich- 
nesses are slightly overestimated, especially for very poor 
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Figure 7. Completeness (bottom plot) and purity rate (top plot) 
for the clusters without red sequence of the simulated mix of 
clusters. The meaning of the lines are the same as in Figure [4] 



clusters. This behavior is expected as the output cluster is 
recovering some contamination from the field galaxies. 

In general, we find a good agreement between the prop- 
erties of the simulated and recovered clusters, only finding 
a higher misalignment and richness overestimation for the 
very poor clusters (Acl ^ 20) . 
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Figure 8. Completeness (bottom plot) and purity rate (top plot) 
for the clusters without red sequence of the simulated mix of 
clusters, using only the likelihood. The meaning of the lines are 
the same as in Figure [4] 
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Figure 9. Position offset in the recovery of the clusters as a 
function of the redshift and Aql- 




Figure 10. Redshift offset in the recovery of the clusters as a 
function of the redshift and Aql- 




Figure 11. Aql richness offset in the recovery of the clusters as 
a function of the redshift and Acl ■ 



4 CLUSTERS IN CFHTLS-ARCHIVE 
RESEARCH SURVEY 

4.1 Description of the sample 

The Canadian-France-Hawaii Telescope Legacy Survey 
(CFHTLS) Deep and Wide consists of 4 and 171 degree 
squares respectively in five optical bands (ugriz). The com- 
pleteness of this survey is up to 26 (CFHTLS-D) and 24.5 
(CFHTLS-W) in r band. The combination of both width 
and depth of this survey makes it an excellent sample to 
test our algorithm to d etect clusters. 

lErben et all (|2009h introduced the CFHTLS- Archive- 
Research Survey (CARS), which is based on the public 
archive images from the Wide CFHTLS. They created multi- 
color catalogues, an d estimated photometric redshifts with 
BPZ llBemtezl feoOO) in almost 37 square degrees, 21, 5 and 
11 square degrees in Wl, W3 and W4 respectively. 

The BPZ provides a quality indicator called odds, which 
is denned as the integral of the redshift probability dis- 
tribution within a given interval centered on the recov- 
ered Bayesian photometric redshift. This parameter is useful 
to discard objects with unreliable photo-z. We selected all 
galaxies with odds parameter > 0.70 as a compromise be- 
tween keeping the maximum number of galaxies in the sur- 
vey and avoiding catastrophic outliers. We consider as galax- 
ies those objects with stellar galaxy parameter less than 0.2 
and magnitude less than 21 and all of them for magnitude 
greater than 21. 

In Figures [12] and 1131 we show the galaxy magnitude 
and photometric redshift distribution for Wl, W3 and W4. 
The redshift distribution peaks at ~ 0.6 and the magnitude 
distribution at m r ~ 25, this being the limit we are going 
to consider as the completeness of the data. 



4.2 Cluster detection 

We detected galaxy clusters in Wl, W3 and W4 from the 
CARS catalogues. We obtained 735, 171 and 340 clusters 
with Acl 20 and z ^ 1.2. This corresponds to 35, 34.2 
and 30.91 per square degree. We now compare those cluster 
candidates with different detections found by other works. 
We should keep in mind though that the way that each 
work merges the detections is different in each case. As an 
example, if we assumed 1 Mpc instead of 1.5 Mpc as the 
maximum distance for the detections to be considered the 
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Figure 12. CARS Magnitude distribution 




Figure 13. CARS Photometric redshift distribution 



same, we would end up with 42.38, 40.4 and 38 detections 
per square degree. Or, if we did not apply any merging pro- 
cedure to the detection, we would obtain 51.24, 52.6 and 
46.55 detections per square degree. This is an issue when 
using galaxy cluster counts for cosmological purposes. 



4-2.1 Comparison with Olsen et al. 2007 

lOlsen et afl (|2007t > (hereafter O07) detected 46 clusters 
in Deepl, which overlaps with one square degree in Wl. 
We detected 53 clusters in the common area of Wl. We 
matched our detections to their detections by using a 
Friend of Friends ( FoF) Algorithm (jHuchra fc Gelled 1 19821 ; 
iBotzler et al.ll2004h . For each of our detections, we searched 
for their O07 friends. These friends have to fulfill that they 
are within a comoving distance of 4 Mpc considering the er- 
rors in the redshift measurement. We calculated the redshift 
errors by measuring the width of the Gaussian that we fit 
to the photometric redshift distribution in order to obtain 
z est ■ Then, we searched for friends of friends until no more 
friends are found. Of the final list of friends, we select the 
one with the closest photometric redshift to the original de- 
tection. Finally, we consider a cluster to have a counterpart 
in B if their best friend is within a comoving distance of 4 
Mpc. By matching A with B we are asking whether each 
cluster in A has at least one counterpart in B, so it is not 
the same as matching B with A. We allow a cluster from A 
to match more than one cluster in B since every different 
study follows different merging algorithms. 

In Figure 1141 we show the relation between the comov- 
ing distance and fraction of matched detections. In the upper 
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Motch Ascoso to Olsen 



Match Olsen (AB) to Ascoso 
Match Ascaso to Olsen (AB) 
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Dist (Mpc) 

Figure 14. Comoving distance between detections versus frac- 
tion of detections that matches O07 to this work (solid line) and 
this work to O07 (dotted line). The upper plot considers the whole 
O07 sample, whereas the bottom plot only considers the A and 
B systems by O07. 



plot, the whole O07 detections are included whereas in the 
bottom plot, only the O07 subsample of 32 clusters, which 
excludes all the C or D systems. (In O07's notation, C sys- 
tems are those which do not show any clear galaxy overden- 
sity at examining the optical image, D grade is assigned to 
those systems which were detected due to artefacts such as 
an image edge or a lack of masking. Additionally, A systems 
refer to clear concentration of galaxies with similar colors 
whereas B systems are defined to appear as an overdensity 
of galaxies, less concentrated than A systems or without 
an obvious color concentration). We can see that with a 4 
Mpc distance we find that 85% of our sample matches O07, 
whereas only 70% of O07 matches our sample. However, if 
we just consider the subsample of A and B systems by O07, 
we find a fraction of 80% matching within 4 Mpc for both 
samples. 

In Figure 1151 we plot the difference between the red- 
shift of the matched clusters with respect to the redshift 
estimated in this work. The top panel refers to the matches 
using all the detections in O07, whereas the bottom panel 
only refers to the detections excluding C and D systems in 
O07. We find a good agreement in redshift at any redshift 
with a slight tendency of zoisen — ZAscaso <0 at ZAscaso > 1. 

Additionally, we matched the catalogs in the reverse or- 
der. By doing this, we want to know how pure our detections 
are with respect to the other catalogue. We used the same 
criteria as previously explained to create the matches. We 
show the difference between the redshift of the matched clus- 
ters with respect to the O07 redshift estimations in Figure 
1161 As before, the top panel refer to the matches using all 
the detection in O07, whereas the bottom panel only refers 
to the detections excluding C and D systems in O07. We 
see a very good agreement between both redshifts at any 
redshift. In the rest of the paper, we include the C and D 
systems. 

In order to take into account the possible biases in the 
way that each work merges their detection by using different 
criteria, we created density maps of the cluster detections. 
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Figure 15. Redshift difference between a cluster candidate found 
in this work and their matched counterpart from O07, with re- 
spect to the redshift estimated in this work. The top panel refers 
to the matches using all the O07 catalogue, whereas the bottom 
panel excludes the C and D systems from O07 catalogue 
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Figure 17. Density maps of the cluster detections in this work 
in the area of CARS that overlaps with the Deep 1 field (color 
maps). The black contours correspond to the level curves of the 
density maps for the O07 catalogue. 
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Figure 16. Redshift difference between a cluster candidate from 
O07 and their matched counterpart in this work, with respect to 
the O07 redshift. The top panel refers to the matches using all 
the O07 catalogue, whereas the bottom panel excludes the C and 
D systems from O07 catalogue. Note the reversal of sign of the 
residuals compared to Fig. 1151 




12.36 9.27 6.18 3.09 0.00 



o 




02 h 24 m 30 s 26 m 00 s 27 m 30 s 

Right Ascension 



Figure 18. The same as in Figure 1171 but the black contours 
correspond to the level curves of the density maps for the O07 
systems classified as A or B. 



Each detection is represented by a a Gaussian centered on 
the position of the cluster candidate, with a width of 1 Mpc 
radius at the redshift of the cluster. In Figure 1171 we show 
the density map for the detections found in this work with a 
color map. Additionally, we overplot the level curves of the 
density maps for the cluster candidates by O07 in Deep 1. 
We see a very good agreement between both detections. The 
main two prominent structures in the upper left and upper 
right are well identified in both cases. We also identify the 
four subclumps at the bottom. The only visible difference is 
the more elongated structure that O07 finds in the bottom 
left of the image and that we do not detect. 

However, when we overplot the level curves of the den- 
sity maps only for the A and B systems, or those which have 
been visually classified as having a clear cluster-like struc- 
ture, this elongation becomes smaller as shown in Figure 
1181 We find very similar structures as found in O07, even if 
the latter are based on Deep 1 data, which is almost ~ 1 
magnitude fainter than the Wide Fields used in this work. 



4-2.2 Comparison with Adami et al. 2010 

Recently. I Adami et al.l l|201Cj ) (hereafter A10) published de- 
tections in 19 out of 21 degrees of Wl, 4 out of 5 degrees in 
W3 and 2 out of 11 degrees in W4. Their method consisted 
of an adaptive kernel technique combined with SExtractor 
for the detection of the structures. They used the T0004 cat- 
alogues from the CFHTLS. They also ob tained photometri c 
redshifts by using the Le Phare software |llbert et al.l feOQG'). 
Thus, the direct comparison between both methods could be 
affected by the systematics of the different catalogues and 
methods used in the data. 

We obtain 682, 138 and 70 clusters in the common area 
of Wl, W3 and W4. They detected clusters over 2 and 3a, 
where the a is estimated from SExtractor. Over 2a, they 
obtain 755, 175 and 99 detections. AlO's 3a catalogs con- 
tain 441, 130 and 31 cluster candidates. As we explained at 
the beginning of the section, the numbers are not directly 
comparable since each work uses a different merging crite- 
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Figure 19. Comoving distance between detections versus frac- 
tion of detections that matches A10 in Wl to this work (solid 
line) and this work to A10 in Wl (dotted line). The upper plot 
considers the A10 sample in Wl over 3<r, whereas the bottom plot 
considers the A10 sample in Wl over 2a. 
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Figure 21. Comoving distance between detections versus frac- 
tion of detections that matches A10 in W4 to this work (solid 
line) and this work to A10 in W4 (dotted line). The upper plot 
considers the A10 sample in W4 over 3<r, whereas the bottom plot 
considers the A10 sample in W4 over 2a. 
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Figure 20. Comoving distance between detections versus frac- 
tion of detections that matches A10 in W3 to this work (solid 
line) and this work to A10 in W3 (dotted line). The upper plot 
considers the A10 sample in W3 over 3a, whereas the bottom plot 
considers the A10 sample in W3 over 2a. 

ria. However, our threshold appears to be somewhat closer 
to their 2a threshold than their 3<r threshold. 

We matched the detections found in this work to their 
detections using the same criteria as in the previous section. 
In Figures 1191 1201 and 1211 we show the maximum comoving 
distance for matching two detection versus the percentage 
of detections for the A10 sample over 3 and 2a respectively 
for the overlapping area with Wl, W3 and W4. In all cases 
the fraction of matched detections become higher than 80% 
for distances less than 4 Mpc. 

In Figures [22l 1231 and 1241 we show the redshift difference 
between the cluster candidates in this work and the matched 
A10 candidates in Wl, W3 and W4 respectively over 3 and 
2cr respectively, as a function of our redshift estimate. The 
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Figure 22. Redshift difference between the cluster candidates 
in this work and matched clusters from AlO's catalogs for the 
common area in Wl. The upper and bottom plot are matched to 
the A10 detections over 3 and 2 a respectively. 



redshift difference is consistent with zero for redshift z<l.l 
and z Adami — ZAscaso tends to be systematically negative at 
very high ZAscaso (zAscaso >1.1). However, the dispersion is 
much smaller when compared to the A10 2a detections, in 
particular in W4. 

Again, we reversed the order of matching to check on 
any artifacts of the matching process. In Figures [231 1261 and 
1271 we show the redshift difference between the cluster can- 
didates in this work that match those A10 candidates for 
Wl, W3 and W4. 

We find some cases in all the fields where the redshift 
difference for the matched clusters to the A10 catalogue is 
negative at higher zaio >1- There are a few cases where 
the difference is positive at zaio <0.3 as in W3. Since the 
redshift errors are not available in A10, we are not taking 
them into account when matching the detection in this work 
to their detection. This might produce a misidentification of 
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Figure 23. Redshift difference between the cluster candidates 
in this work and matched clusters from AlO's catalogs for the 
common area in W3. The upper and bottom plot are matched to 
the A10 detections over 3 and 2 a respectively. 
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Figure 26. Redshift difference between the cluster candidates in 
A10 and their matched cluster candidate from this work in the 
common area in W3. The upper and bottom plot are detections 
matched to A10 catalogues over 3 and 2 a respectively. 
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Figure 24. Redshift difference between the cluster candidates 
in this work and matched clusters from AlO's catalogs for the 
common area in W4. The upper and bottom plot are matched to 
the A10 detections over 3 and 2 a respectively. 
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Figure 27. Redshift difference between the cluster candidates in 
A10 and their matched cluster candidate from this work in the 
common area in W4. The upper and bottom plot are detections 
matched to A10 catalogues over 3 and 2 a respectively. 
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Figure 25. Redshift difference between the cluster candidates in 
A10 and their matched cluster candidate from this work in the 
common area in Wl. The upper and bottom plot are detections 
matched to A10 catalogues over 3 and 2 a respectively. 



the right candidate. Other than that, we find a good agree- 
ment between both detection. 

As before, we created the density maps of the detections 
found in this work for the common area and compared them 
with contour levels for the detections found by A10 over 3 
and 2(j in order to avoid systematics due to different merging 
procedures. In Figures [28ll29l and l30l we show the detections 
for the common area of Wl, W3 and W4. Generally, the 
large structures are well matched in all fields. In general, 
there is a better agreement with A10 2a detections. 

As a conclusion, a direct comparison of our work with 
A10 in data with similar depth shows a larger number of 
structures in our analysis, which are detected over 2a in the 
work by A10 but not over 3a. In addition, there are some 
more structures that we find in this work and are not found 
in A10, which seem to be real (see section 5). 

Eventually, we matched the detections from A10 to O07 
catalogue using the same criteria. In Figure |3"T1 we show the 
redshift difference between the detections in O07 matched 
to the detections in A10 over 3 and 2a respectively, with 
respect to the O07 cluster redshift. We find a very good 
agreement for all redshifts. By matching the O07 catalogue 
to the A10 catalogue detections, we also find a good agree- 
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Figure 28. Density maps of the cluster detections in this work in 
the area of CARS that overlaps with the area analyzed by A10 in 
field Wl (color maps). The black contours correspond to the level 
curves of the density maps from the A10 catalogue. Upper and 
bottom plots show the level curves over AlO's 2 and 3cr catalogs 
respectively. 



Figure 29. Density maps of the cluster detections in this work in 
the area of CARS that overlaps with the area analyzed by A10 in 
field W3 (color maps). The black contours correspond to the level 
curves of the density maps from the A10 catalogue. Upper and 
bottom plots show the level curves over AlO's 2 and 3cr catalogs 
respectively. 



merit as can be seen in Figure [32] The dispersion increases 
at higher z A dami- 

4-2.3 Comparison with X-ray data and spectroscopically 
confirmed sample 

We checked the agreement between the clusters that we de- 
tected iri_tlie_op_tical, and X-ray detections found in these 
fields. iPacaud et all |2007l ) published 29 spectroscopically 
confirmed clusters in the XMM-LSS survey, 17 of which are 
in Wl. Additionally, O07 made a compilation of 18 X-ray 
spectroscopically con firmed clusters in th e same fields, of 
which 8 appear in the lPacaud et afl |2007l ) sample. 

Of these 25 clusters, we matched them to our sample 
by following the same criteria as before and we detected 
23 within a minimum comoving distance of less than 4 Mpc 
distance. Even if this result is promising, the results might be 
biased since almost all the spectroscopy sample is selected to 
be in the high end of the mass function for clusters, making 
these results easier to be detected. 

We find an excellent agreement between the redshift 
matches as we see in Figure [33] The median and dispersion 
of the redshift differences is 0.0024 and 0.088, well within 
the predicted errors. 

We also matched the sample of spectroscopically de- 



tection to O07 and A10 over 3a. In Figures l34l and l35l we 
show the redshift differences for the matched candidates to 
the sample. A10 finds all 25 galaxy clusters within 5 Mpc, 
where O07 finds 22. Let's note that AlO's catalog is ~ 1 mag 
shallower than O07. However, the dispersion of the redshift 
difference matching is 0.058 and 0.1238 for A10 and O07 
respectively. 



5 DISCUSSION AND CONCLUSIONS 

We presented a new method based on a Bayesian approach 
of the matched filter technique, including the introduction 
of a prior containing the color-magnitude information, the 
BCG-redshift relation and the photometric redshift. 

We performed simulations to test how well our clusters 
are recovered. We simulated galaxy clusters showing red se- 
quence at different redshift and with different richnesses. 
We found very high completeness rates (> 90%) and purity 
rates for clusters with Acl ^ 20 (> 80%) up to redshift 1.2 
at least. The results were very similar with and without the 
inclusion of the prior. This result suggests that galaxy clus- 
ters showing a red sequence can be found without the need 
to model the color-magnitude red sequence. In addition, the 
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Figure 30. Density maps of the cluster detections in this work in 
the area of CARS that overlaps with the area analyzed by A10 in 
field W4 (color maps). The black contours correspond to the level 
curves of the density maps from the A10 catalogue. Upper and 
bottom plots show the level curves over AlO's 2 and 3cr catalogs 
respectively. 
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Figure 32. Rcdshift comparison between the O07 cluster and 
their matched detections from A10 in the area that overlaps with 
Dl over 3 and 2a (upper and bottom plot respectively). 




Figure 33. Redshift comparison between the X-ray spectroscop- 
ically detected clusters and our detections. 




Figure 31. Redshift comparison between the detections from 
A10 over 3 and 2a and their matched detections by O07 (upper 
and bottom plot respectively). 



completeness and purity rates are high for a wide range of 
masses and redshifts. 

It is still not clear if every galaxy cluster exhibits a 
red sequence. Different methods (X-ray, SZ, weak lensing) 
have been finding these clusters independently of their colors 
in the high mass end and moderate redshift. Almost all of 



them appear to have a well defined red sequence. However, 
we do not know if this is the case at the low mass end of 
the cluster distribution or at high redshift. Hence, we tested 
the method on a mix of clusters, ie: galaxy clusters show- 
ing and not showing a red sequence. The results could be 
different since when running the algorithm globally, we de- 
termined a background probability for each redshift slice for 
the whole field. This background probability is determined 
by the probability distribution of the galaxies that exist in 




Figure 34. Rcdshift comparison between the X-ray spectroscop- 
ically detected clusters and O07 detections. 
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Figure 35. Redshift comparison between the X-ray spcctroscop- 
ically detected clusters and A10 detections. 
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Figure 36. Cross-correlation as a function of angular separation 
of the cluster distribution in this work (solid line), between this 
work and A10 (dotted line; over 3a, dashed line; over 2a) and 
between this work and O07 (dashed-dotted line). We used the LS 
estimator. 



the survey, including those galaxies in clusters. In this case, 
the galaxy clusters not showing a red sequence could af- 
fect the main background probability, making the non-red 
sequence clusters tail not to pass this threshold or, on the 
contrary, including spurious detections in the sample. 

The results of completeness and purity for the whole 
mix of clusters are very similar to the red sequence galaxy 
cluster simulations, with a slight decrease of purity (but still 
> 80%) at higher redshift. As expected, the introduction of 
the prior in these simulations does not make a difference 
since those clusters are, by construction, not showing a red 
sequence. 

We applied this algorithm to the CARS Survey 
ijErben et al]|2009r i based on the CFHTLS Wide data and 
compared the results with the different works in the optical 
in the area corresponding to Dl by O07 and Wl, W3 and 
W4 by A10. 

It is hard to directly compare different works since every 
work uses a different merging and centering algorithm to 
obtain the final cluster catalogue. We calculated the cross- 
correlation between A10 and O07 and our work and it is 
shown in Figure|36l bv using the Landy-Szalay estimator (LS; 
lLandv fc Szalavll993h . We plot the auto correlation function 
as a reference. The cross-correlation function between two 
different sub-samples measures the excess probability over 
random of finding a cluster in the second sample at a given 
separation from a cluster belonging to the first sample. We 
find that both samples have a similar amplitude, being the 
clusters found by A10 over 3a the most correlated with the 
ones in this paper at smaller scales. 

In Figure I37I we show the redshift distribution of the 
detections found by O07, A10 and this work. O07 finds a 
peak of detections at z ~ 0.4, which are mostly low mass 
structures detected in their deeper data. Apart from this, 
the redshift detection distribution in this works agrees well 
with the redshift distribution from O07 and with the detec- 
tions found by A10 over 2a better than 3a. However, we find 
a peak of detections at z ~ 1 that the other works do not 
find. Many of these cluster candidates seem to be real by 
looking at the images as shown in Figure [38] In fact , some 
of them were detected with X-rays in iPacaud et all (|2007l ) 
as the first cluster in Figure 1381 They provided a redshift 
estimate of 1.05 for this cluster, whereas we obtain a red- 
shift estimate of 1.01. These detections will be confirmed by 
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Figure 37. Redshift distribution for the clusters detections found 
in this work (solid line), O07 (dotted line) and A10 over 2 and 3a 
(dashed and dashed-dotted lines). 



obtaining spectroscopic information or deeper data in these 
regions. 

In summary, we find a good agreement with the galaxy 
clusters that we find in CARS and other detections found in 
similar or deeper data of the same fields. In addition, we find 
several detections at high redshift which are not reported 
and appear to be real based on visual characteristics. We are 
currently applying this algorithm to the Deep Lens Survey 
(DLS), (Ascaso et al. in preparation) in order to provide the 
first optical galaxy cluster in the survey. 
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